Robust Speech Feature Extraction Using the Hilbert Transform Spectrum Estimation Method
نویسندگان
چکیده
The performance of traditional mel-frequency cepstral coefficients (MFCC) speech feature extraction method decreases drastically in the complex noisy environment. To improve the performance and robustness of speech recognition system, which is based on spectral envelope estimation method, the minimum distortionless response spectrum MVDR-MFCC (Minimum Variance Distortionless Response-MFCC) feature extraction method was proposed. However, the computational complexity of MVDR-MFCC is very high. In this paper, we proposed MHCC (Hilbert-MFCC) feature extraction method for speech, which introduced the Hilbert transform to MFCC process. The experiments, under 8 different noisy environments, indicate that, compared with MVDR-MFCC feature extraction method, the proposed method not only reduces the algorithm’s complexity significantly, but also is less affected by noises, achieving significant improvement in the robustness—the average recognition rate across different noise types and SNRs increases by 12%.
منابع مشابه
Regularized MVDR spectrum estimation-based robust feature extractors for speech recognition
In this paper, we present two robust feature extractors that use a regularized minimum variance distortionless response (RMVDR) spectrum estimator instead of the discrete Fourier transform-based direct spectrum estimator, used in many front-ends including the conventional MFCC, for estimating the speech power spectrum. Direct spectrum estimators, e.g., single tapered periodogram, have high vari...
متن کاملRobust Feature Vector Set Using Higher Order Autocorrelation Coefficients
In this paper, a feature extraction method that is robust to additive background noise is proposed for automatic speech recognition. Since the background noise corrupts the autocorrelation coefficients of the speech signal mostly at the lower orders, while the higher-order autocorrelation coefficients are least affected, this method discards the lower order autocorrelation coefficients and uses...
متن کاملImproved MFCC Feature Extraction Combining Symmetric ICA Algorithm for Robust Speech Recognition
Independent component analysis (ICA), instead of the traditional discrete cosine transform (DCT), is often used to project log Mel spectrum in robust speech feature extraction. The paper proposed using symmetric orthogonalization in ICA for projecting log Mel spectrum into a new feature space as a substitute in extracting speech features to solve the problem of cumulative error and unequal weig...
متن کاملMVDR based feature extraction for robust speech recognition
This paper describes a robust feature extraction method for continuous speech recognition. Central to the method is the Minimum Variance Distortionless Response (MVDR) method of spectrum estimation and a feature trajectory smoothing technique for reducing the variance in the feature vectors. The above method, when evaluated on continuous speech recognition tasks in a stationary and moving car, ...
متن کاملFeature extraction from higher-lag autocorrelation coefficients for robust speech recognition
In this paper, a feature extraction method that is robust to additive background noise is proposed for automatic speech recognition. Since the background noise corrupts the autocorrelation coefficients of the speech signal mostly at the lowertime lags, while the higher-lag autocorrelation coefficients are least affected, this method discards the lower-lag autocorrelation coefficients and uses o...
متن کامل